PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Csa07g010290.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Camelina
Family HD-ZIP
Protein Properties Length: 734aa    MW: 80466.1 Da    PI: 5.7423
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Csa07g010290.1genomeCSGPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox56.93.6e-1874129156
                     TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
        Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                     r+k  +++++q++e+e++F+++++p+ ++r++L ++lgL   q+k+WFqN+R++ k
  Csa07g010290.1  74 RKKYNRHSQYQIHEMEAFFKECPHPDDKQRRDLGRQLGLAPVQIKFWFQNKRTQNK 129
                     799999***********************************************998 PP

2START170.31.2e-532574782206
                     HHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEE CS
           START   2 laeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kaetle 83 
                     la  a++el+++a+++ep+W+  +      +n de+ ++f ++ +     +s+ea+r++++v m+++ +ve l++ +  W++++     +a t+e
  Csa07g010290.1 257 LAIGAMEELLLMAQVGEPLWMGGVdgtsLALNLDEYARTFRKGLGprlsgFSIEASRETALVAMNPTGVVEMLMQAN-LWSTMFVgmvgRAITHE 350
                     67889****************9998877789*********88776********************************.***************** PP

                     EECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE CS
           START  84 vissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvd 171
                     ++ ++      galq m+ae+q+lsplv+ R+++fvRy++q+g+  w++vdvS+d+  ++      ++++++pSg+li++ +ng+skvtwvehv+
  Csa07g010290.1 351 KLLTDvagnfnGALQIMSAEYQVLSPLVStRESYFVRYCKQQGENLWAVVDVSIDHLFPNIH----MKCRRRPSGCLIQEIPNGYSKVTWVEHVE 441
                     *********************************************************99975....999************************** PP

                     --SSXX..HHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
           START 172 lkgrlp..hwllrslvksglaegaktwvatlqrqcek 206
                     +++r    +++++++++sg+a++a++wvatl+rqce+
  Csa07g010290.1 442 VDDREAanQNIFKHFISSGQAFAANRWVATLERQCER 478
                     ***99989***************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466891.02E-1766135IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.605.9E-1968136IPR009057Homeodomain-like
PROSITE profilePS5007116.22871131IPR001356Homeobox domain
SMARTSM003892.9E-1773135IPR001356Homeobox domain
CDDcd000865.98E-1774132No hitNo description
PfamPF000469.6E-1674129IPR001356Homeobox domain
PROSITE profilePS5084837.933247481IPR002913START domain
SuperFamilySSF559616.6E-31248480No hitNo description
CDDcd088756.60E-113252477No hitNo description
SMARTSM002348.4E-44256478IPR002913START domain
PfamPF018521.7E-44257478IPR002913START domain
Gene3DG3DSA:3.30.530.207.4E-6308442IPR023393START-like domain
SuperFamilySSF559615.5E-14501690No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0048825Biological Processcotyledon development
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 734 aa     Download sequence    Send to blast
MSEPNMVPVD NNGDNDNNEN NDMNNTDGGL YNTNGGAGAG VEAGVGAEEI DSARTVSDSR  60
EEEMGSDQGP PRKRKKYNRH SQYQIHEMEA FFKECPHPDD KQRRDLGRQL GLAPVQIKFW  120
FQNKRTQNKN HQERCENTEL RSLNSKLRSE NERFREAVHL ALCPKCGGKT AIGEMSFEEH  180
HLRIVNARLN EEINELPALA VRFSSKAVIS YPAISPRPSN HPPTFEFGAG SSSGSGGNLS  240
RGITGPADVD TPMIMELAIG AMEELLLMAQ VGEPLWMGGV DGTSLALNLD EYARTFRKGL  300
GPRLSGFSIE ASRETALVAM NPTGVVEMLM QANLWSTMFV GMVGRAITHE KLLTDVAGNF  360
NGALQIMSAE YQVLSPLVST RESYFVRYCK QQGENLWAVV DVSIDHLFPN IHMKCRRRPS  420
GCLIQEIPNG YSKVTWVEHV EVDDREAANQ NIFKHFISSG QAFAANRWVA TLERQCERIA  480
SITTTDFQAV DSPDHLVLTG HGKTSILKLA ERVTRSFFVG LTSSMGTTFS GVGGDDIRVM  540
TMKNINDPGR PPGVVFSAAT SFWVPAPPKI VFDFLRDVEH RASWDVLCAG GVVHKISEIA  600
NGRDSRNCAT LLRNEIPCEK KMMIIQETST DPTASFVIYA PVDTASIEGV LSEGQDPDYV  660
ALLPSGFAIL PDGMGDQPGG SLLTVSFQML VEEVHSGKLS ITSVATVENL IRTTVLRIKA  720
LFPFQIATPS LGK*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
17175RKRKK
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010414006.10.0PREDICTED: homeobox-leucine zipper protein HDG3-like isoform X1
SwissprotQ9ZV650.0HDG3_ARATH; Homeobox-leucine zipper protein HDG3
TrEMBLR0HIB70.0R0HIB7_9BRAS; Uncharacterized protein
STRINGAT2G32370.10.0(Arabidopsis thaliana)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM49128149
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G32370.10.0homeodomain GLABROUS 3